Content-independent duration model on categories of voice and unvoice segments
نویسنده
چکیده
Trying to understand the experimental data on segmentation of a speech signal by a principle "Voice/Unvoice" has led us to the hypothesis about a pair of logistical dependence between durations of these segments. The segmentation was carried out with the help of the computer program working in quasi real time. The hypothesis about logistic recurrent dependence for sequence of segments durations has allowed to make a conclusion about quasi rhythmical organization of this sequence. With the help of offered recurrent dependences it is possible to explain statistical peculiarities of speech behaviour of stutterers in comparison with normal speech behaviour. These logistic dependences were confirmed by direct experimental data. The assumption of origins of specified rhythm is made. These origins are hidden at the level of control of speech production and perception. Is shown, that the chaotic nature of offered dynamics of formation of large-scale temporary structure allows to enter concept of the information into consideration by a natural way.
منابع مشابه
The Business Model of Sports Academies with an Emphasis on Value Proposition and Customer Segments
Background. Nowadays, sport is considered as a good base for marketing and entrepreneurship. Business model design is also increasingly welcomed. But in the field of sports businesses and sports academies in Iran, no research has been conducted, and no specific business model has been introduced. Objectives. The purpose of this study is to identify and prioritize the value proposition componen...
متن کاملComparison of Modeling Target in LSTM-RNN Duration Model
Speech duration is an important component in statistical parameter speech synthesis(SPSS). In LSTM-RNN based SPSS system, the speech duration affects the quality of synthesized speech in two aspects, the prosody of speech and the position features in acoustic model. This paper investigated the effects of duration in LSTM-RNN based SPSS system. The performance of the acoustic models with positio...
متن کاملThe Prosody of Discourse Structure and Content in the Production of Persian EFL Learners
The present research addressed the prosodic realization of global and local text structure and content in the spoken discourse data produced by Persian EFL learners. Two newspaper articles were analyzed using Rhetorical Structure Theory. Based on these analyses, the global structure in terms of hierarchical level, the local structure in terms of the relative importance of text segments and the ...
متن کاملVoice-based Age and Gender Recognition using Training Generative Sparse Model
Abstract: Gender recognition and age detection are important problems in telephone speech processing to investigate the identity of an individual using voice characteristics. In this paper a new gender and age recognition system is introduced based on generative incoherent models learned using sparse non-negative matrix factorization and atom correction post-processing method. Similar to genera...
متن کاملBlind Voice Separation Based on Empirical Mode Decomposition and Grey Wolf Optimizer Algorithm
Blind voice separation refers to retrieve a set of independent sources combined by an unknown destructive system. The proposed separation procedure is based on processing of the observed sources without having any information about the combinational model or statistics of the source signals. Also, the number of combined sources is usually predefined and it is difficult to estimate based on the ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998